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Abstract 

In this paper, we propose a novel structural correla¬ 
tion filter combined with a multi-task Gaussian particle fil¬ 
ter (KCF-GPF) model for robust visual tracking. We first 
present an hieraichical structure where several KCF track¬ 
ers as weak experts provide a preliminary decision for a 
Gaussian particle filter to make a final decision. The pro¬ 
posed method is designed to exploit and complement the 
strength of a KCF and a Gaussian particle filter. Com¬ 
pared with the existing tracking methods based on correla¬ 
tion filters or particle filters, the proposed tracker has sev¬ 
eral advantages. First, it can detect the tracked target in a 
large-scale search scope via weak KCF trackers and eval¬ 
uate the reliability of weak trackers ’decisions for a Gaus¬ 
sian particle filter to make a strong decision, and hence it 
can tackle fast motions, appearance variations, occlusions 
and re-detections. Second, it can effectively handle large- 
scale variations via a Gaussian particle filter. Third, it can 
be amenable to fully parallel implementation using impor¬ 
tance sampling without resampling, thereby it is convenient 
for VLSI implementation and can lower the computation¬ 
al costs. Extensive experiments on the OTB-2013 dataset 
containing 50 challenging sequences demonstrate that the 
proposed algorithm performs favourably against 16 state- 
of-the-art trackers. 

1. Introduction 

Visual tracking is one of the most fundamental problems 
in computer vision due to its numerous applications such as 
video surveillance, motion analysis, vehicle navigation and 
human computer interactions. Although a great progress 
has been seen on developing algorithms [19, 39, 40, 46] 
and benchmark evaluations [43] for visual tracking, visu¬ 
al tracking is still a challenging problem in the situations of 
heavy illumination changes, pose deformations, partial and 
full occlusions, large scale variations, background clutter 


and fast motion. 

Correlation filters have recently attracted a great atten¬ 
tion due to their rapid speeds of calculation and robust track¬ 
ing performance [24, 27, 28, 29]. Bolme et al. [6] proposed 
an adaptive correlation filer, called MOSSE, for producing 
ASEF-like filters by fewer training images. Henriques et al. 
[3 ] extended the correlation-filter-based trackers to kernel- 
based training, called CSK method, to utilize a circulant 
structure of one image patch to conduct dense sampling, and 
then improved the KCF tracker [19] by using multi-channel 
inputs and HOG descriptors. Danelljan et al. [14] develope- 
d the DSST method handling scale changes of a target, and 
Choi et al. [1 ] proposed a spatially attentional weight map 
to weight various correlation filters. Ma et al. [30] used a 
correlation filter as a short-term tracker and an online ran¬ 
dom fern classifier for re-detection as a long-term memory 
system. 

Although achieved the appealing results both in preci¬ 
sion and success rate, these correlation-filter-based tracker- 
s cannot deal with fast motions and scale variations well. 
For example, although two correlation-filter-based tracker- 
s, namely SCT[10] and KCF [19], have achieved state-of- 
the-art results and have beaten all other attended trackers in 
terms of accuracy in the OTB-2013 dataset [43], they fail to 
track a target object when partial occlusions or illumination 
changes occur. 

To deal with the above issues, we propose a novel and an 
ensemble tracker, which first builds weak trackers by apply¬ 
ing structural correlation filters, and then integrate all weak 
trackers into one stronger tracker using a reliability evalu¬ 
ation and a multi-task Gaussian particle filter. Each weak 
tracker is treated as an expert and the weights of all experts 
are computed via their confidence maps. Through the relia¬ 
bility evaluation, the weak trackers provide weak decision- 
s for the Gaussian particle filter to make a final decision. 
The tracking result in the current frame is interred by the 
weights of the Gaussian particle filter. Particles respective¬ 
ly calculate two results using HOG and gray-normalization 
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features, and therefore the Gaussian particle filter conducts 
a multi-task tracking for making a strong decision. 

The contributions in this paper are summarized as fol¬ 
lows. 

• A novel ensemble algorithm is proposed to combine 
structural correlation filters and a Gaussian particle fil¬ 
ter into a single stronger tracker. 

• A large-scale search scope algorithm using for multi¬ 
ple structural correlation filters is proposed, consider¬ 
ing spatial geometric relations between target locations 
in consecutive frames and therefore making the search 
reliable. 

• Extensive experiments in the OTB-2013 benchmark 
dataset [43] with 50 challenging sequences and 11 var¬ 
ious attributes to demonstrate the outperformance of 
the proposed method in comparison with 16 state-of- 
the-art trackers. 

2. Related Work 

A comprehensive tracking review can be found in [43, 
44, 3 ]. In this section, we discuss the methods closely 
related to this work, mainly regarding multi-task correla¬ 
tion filters, Gaussian particle filters and Average Peak-to- 
Correlation Energy (APCE). 

2.1. Structural Correlation Filters 

Qi et al. [34] described the weak correlation filers on 
CNN features in each layer and Liu et al. [2 ] proposed the 
concept of the structural correlation filter. 

In [34], X k G R Px Q xD denote the feature map extracted 
from the k- th convolutional layer with Gaussian function 
label Y E R PaQ . Let X k = T(X k ) and >' = F(Y), where 
!F{-) represents the discrete Fourier transformation (DFT). 
The objective function of correlation filter method [3 2 ] can 
be extended into its k- th filter modeled as 

W fc = argmin \\y - X k • W\\ 2 F + X\\W\\ 2 F , (1) 

w 

where 

D 

x k -w = XAwO v 'w (2) 

d= 1 

Here, the symbol 0 is the element-wise product. 

The optimization problem in Eq. 1 has a simple closed 
form solution, which can be efficiently computed in the 
Fourier domain by 

W *’*’d= X*.X'' + \ O *»■*■<*• (3) 

Given the testing data T k from the output of the k- th lay¬ 
er, we first transform it to the Fourier domain T k = F{T k ), 


and then the responses can be computed by 

S k =T~ 1 (T k -W k ), (4) 

where T~ x denotes the inverse of DFT. 

The fc-th weak tracker outputs the target position with 
the largest response 

(x k ,y k ) = aigmaxS k (x',y'). (5) 

x',y' 

2.2. Gaussian Particle Filters (GPF) 

Kotecha and Djuric [2 ] introduced the Gaussian Particle 
Filter (GPF), which is used for tracking filtering and pre¬ 
dictive distributions encountered in Dynamic State-Space 
models (DSS) [18]. The DSS model represents the time- 
varying dynamics of an unobserved state variable. GPF is 
based on the particle filtering and Gaussian filtering con¬ 
cepts. Gaussian filters provide Gaussian approximation- 
s to the filtering and predictive distributions, and they in¬ 
clude Extended Kalman Filter (EKF) [21] and its variations 
[20, 31, 37, 42]. Unlike EKF, which assumes that predic¬ 
tive distributions are Gaussian and employs linearization of 
the functions in the process and observation equations, GPF 
updates the Gaussian approximations using particles. GPF 
only propagates the posterior mean and covariance of an 
unobserved state variable in a DSS model, and essentially 
importance sampling makes the procedure simple. 

Particle filters [16, 32, 33] use sequential importance 
sampling (SIS) [2< ] to update the posterior distributions. 
GPF is quite similar to SIS filters by the fact that impor¬ 
tance sampling is used to obtain particles. However, a phe¬ 
nomenon called sample degeneration occurs wherein only 
a few particles representing the distribution have significant 
weights. A procedure called resampling [2 ] has been intro¬ 
duced to mitigate this problem, but it may give limited re¬ 
sults and may be computationally expensive. Since GPF ap¬ 
proximates posterior distributions as Gaussians, unlike the 
SIS filters, particle resampling is not required. This results 
in a reduced complexity of GPF as compared with SIS with 
resampling and is a major advantage. Furthermore, Berzui- 
ni et al. [5] reported that particle filters with resampling also 
had bias due to resampling, and resampling in SIS filters is a 
nonparallel operation. Fortunately, resampling would nev¬ 
er occur in GPF simulation examples, and the particle filters 
in GPF are amenable to parallel implementation. Therefore, 
GPF is more amenable for fully parallel implementation in 
very large scale integration (VLSI) than SIS. 

Simulation results are presented to demonstrate the ver¬ 
satility and improved performance of GPF over convention¬ 
al Gaussian filters and the lower complexity of GPF than the 
known particle filters. However, the parallelizibility of GPF 
and the absence of resampling makes it convenient for VLSI 
implementation and, hence, feasible for practical real-time 
applications. 



2.3. Ensemble trackers 

Multiple component trackers have been combined with 
hand-crafted features to develop ensemble tracking methods 
[1, 2,4] ] for visual tracking. For example, several ensemble 
methods [1,2] using a boosting framework [15] train each 
component weak tracker to classify foreground objects and 
background. In [4 ], Wang and Yeung used a conditional 
particle filter to infer a target’s position and the reliability 
of each component tracker. Qi et al. [34] treated tracking 
as a decision-theoretic online learning task and the tracked 
target was inferred by using the decisions from multiple ex¬ 
pert trackers. Similar to [34], we consider visual tracking 
as a decision-theoretic online learning task [9], and use it in 
the structure of multiple correlation filters combined with a 
Gaussian particle filter. That is, in every round, each corre¬ 
lation filter makes a decision and the final decision is deter¬ 
mined by a Gaussian particle filter. 

2.4. Average Peak-to-Correlation Energy (APCE) 

For correlation filters, the peak value F max denotes the 
maximum response score of a response map. To measure 
the fluctuation degree of a response map and the reliabili¬ 
ty degree of a detected object, Wang et al. [40] proposed 
an average peak-to-correlation energy (APCE) which is de¬ 
fined as 


APCE = 


I f —F \ 

I ± max 1 mm \ 


( 6 ) 


mean | ^ ( F w h F m i n 

\w,h , 


where F max , F m i n and F w ^ denote the maximum, min¬ 
imum and the w -th row and h -th column elements of 
F(s,y;w). APCE indicates the fluctuated degrees of re¬ 
sponse maps and the confidence degrees of the tracked re¬ 
sults. For sharper peaks and less noise, in the case that the 
target fully appearing in a tracking region, APCE will be¬ 
come greater and the response map will become smoother 
except for only one sharp peak. On the other hand, APCE 
will be small if an object is occluded or missing. 


3. Proposed Algorithm 

In this section, we present the combination of structural 
correlation filters with a multi-task Gaussian particle filter 
for ensemble tracking, namely KCF-GPF. Different from 
the KCF method [19, 3 ] that learns a single correlation fil¬ 
ter in a fixed-size area, KCF-GPF is proposed to construct 
multiple weak correlation filters in a more reliable search 
scope for dealing with fast motion issues and bound effects 
in the conventional correlation filters. The Gaussian parti¬ 
cle filter jointly learns particle weights based on different 
features to make a stronger tracker by a multi-task method. 
Furthermore, our tracker can effectively handle scale vari¬ 



Figure 2. Illustration of the target tracking using the weak struc¬ 
tural correlation filter in sequence Deer from OTB-2013 dataset 
[ -3]. The search scope is based on the max historical moving dis¬ 
tance of the object in horizontal and vertical directions at time t, 
namely d% and d\, and considers the last-frame object location as 
the center. A search window slides in the search scope to enclose 
a sample corresponding to a weak correlation filter. 


ations via the sampling strategy of a Gaussian particle fil¬ 
ter. Overall, the proposed ensemble method will achieves 
the following two goals: 1) weak expert trackers are tuned 
to separate a foreground object from background and 2) the 
ensemble as a whole ensures the temporal coherence of each 
part of the tracker. 

3.1. Weak Structural Correlation Filter 

A conventional correlation filter has bound effects [13] 
during a target tracking and it interferes with the progress of 
target detection with fast motions. For this, we extend the 
conventional search window for a single tracker to a large- 
scale search scope for multiple trackers, and exploit spatial- 
geometric relations between target locations in consecutive 
frames to make the search scope reliable. 

The feature map X k in Eq. 1 is extracted from an image 
patch which is sampled by a search sliding window (see Fig¬ 
ure. 2), where d t donates the maximum historical-moving- 
distance of a tracked target in Eq. 17 at time t and the object 
location in the previous frame is the center of the search s- 
cope. Besides it, we respectively use d% and d\ to represent 
the horizontal and vertical moving distances at time t. 

Given the initial confidence weights of all weak experts, 
in the current round, a further decision is made based on 
the expert tracker with the greatest weight. In the visual 
tracking scenario, it is natural to treat each KCF tracker as 
an expert and then predict the target position at time t by 

(X* , y*t) = (Xt, Vt ) • [1 - Sign( | w'l - max |)], (7) 

where w k is the weight of expert k and Ylk=i w t = 1> H 
denotes the absolute value and sign represents the signum 
function. 

Kernel Selection: We choose the Gaussian kernel in the 
existing correlation filter tracker [19]. 
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Figure 1. Flowchart of the proposed algorithm. The proposed algorithm consists of three components: 1) constructing multiple weak 
trackers using correlation filters where each one is trained using HOG features and makes a weak decision (Section 3.1); 2) evaluating the 
reliability degree of each weak decision via maximum response score and APCE measure and the most reliable decision is used for the 
next step (Section 3.2); 3) constructing a stronger tracker and making a final decision via a multi-task Gaussian particle filter which takes 
the most reliable decision into account (Section 3.3). 


Feature Representation: Similar to [19], we use HOG 
features with 31 bins. However, our tracker is quite generic 
and any dense feature representation with arbitrary dimen¬ 
sions can be incorporated. 

Compared to the HDT [34] and SCF [27] methods which 
are similar to the proposed weak structural correlation filter, 
we demonstrate differences among these approaches as fol¬ 
lows. 

1. HDT uses CNN features, while SCF and KCF-GPF are 
based on HOG features. 

2. The features of HDT are extracted from one layer to 
build a weak tracker, and the part-based correlation fil¬ 
ter SCF samples several parts of a target object to con¬ 
struct features, while KCF-GPF samples via sliding 
window in a search scope which is subject to the maxi¬ 
mum historic-moving-displacement of the tracked tar¬ 
get over time t. 

3. In HDT, the target position is made based on the 
weighted decisions of all experts and SCF solves the 
optimization problem using the fast first-order Alter¬ 
nating Direction Method of Multipliers (ADMM) [7], 
while KCF-GPF exploits Eq. 7 to infer the ultimate 
target position. 

3.2. Reliability Degree of Correlation Filter 

The peak value and the fluctuation of the response map 
can reveal the confidence degree about the tracking results 
to some extent. The ideal response map should have only 
one sharp peak and be smooth in all other areas when the 


detected target is extremely matched to the correct target. 
The sharper the correlation peaks are, the better the loca¬ 
tion accuracy is. Otherwise, the whole response map will 
fluctuate intensely, and its pattern is significantly different 
from normal response maps. If we continue to use uncertain 
samples to update the tracking model, it would be corrupted 
mostly. 

Inspired by [40], we evaluate the stability of the k- th ex¬ 
pert with two criteria. The first criterion is the maximum 
response score F max of the response map defined as 

Fmax = ma xS k (x', y') (8) 

x',y' 

where S k (x f , y') is referred to Eq. 4 and Eq. 5. 

The second criterion is called average peak-to- 
correlation energy (APCE) measure which is defined in Eq. 
6. 

3.3. Multi-task Gaussian Particle Filter 

The proposed multi-task Gaussian particle filter at time 
t approximates the posterior mean fi t and covariance T, t of 
the unknown state variable x t using Bayesian importance 
sampling. 

We draw samples from the importance function i r(-) at 
time t using 

7r(xt|y 0 :t) = A/"(x t ; /l i u E t ), (9) 

and denote them as {x^}j l l 1 . Here, yo : * is the observations 
over time t, and Af(-) represents a Gaussian function. Note 
that, in Eq. 9, /l i t is equal to Eq. 7 and Si is chosen based 
on prior information. 





The respective weights are computed by 


3.4. KCF-GPF Tracker 
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wi = 


P(y«|Xf )^V(xt = 

7r(Xt|yO:t) 


( 10 ) 


where the distribution p( y t |x^) represents the observation 
equation y t conditioned on the unknown state variable 
at time t. 

Eq. 10 can be rewritten as follows from Eq. 9: 

W 3 t ocp(y t |xj). (11) 

Then, we setp(yt|x|) = |/* — /'(x))|, where |-| denotes 
the absolute value and f t is the function of features, where 
/ t * represents the features of the template at time t. Hence, 
each Gaussian particle weight can be calculated with 


w t \ ft - /( x t) I 

Normalize the weights as 


w J + 


w J + 


\r-^M 

2^7 = 1 


( 12 ) 


(13) 


In this paper, we adopt HOG for tackling deformation 
variations and gray normalization of raw pixel values from 
sample images for handling illumination changes, and the 
j -th features are represented by fp ho( p and fp norrr p at time 
t , respectively . Hence, we get the corresponding weights 


w 


t(hog) 


and wl, 


t(norm) ’ 


For the multi-task Gaussian particle filter, we jointly de¬ 
fine the similarities of respective samples as 


w J t =0- w° t{hog) + (1 - 9) • u> J t{norm) , (14) 

where learning rate parameter 0 is a constant value in this 
paper. 

The mean and covariance are estimated by 


M 


m = 

(15) 

3 = 1 


M 


-x^)(|it -x^) H , 

(16) 


3 = 1 


where H represents the Hermitian Matrix. 

The maximun historical-moving-distance of the tracked 
target at time t is defined as 


dt = max(|/L Li - fii- 1 | 

i=l 


(17) 


Figure 1 illustrates the flowchart of the proposed algo¬ 
rithm. Based on the structural correlation filter and a Gaus¬ 
sian particle filter, we propose a KCF-GPF tracker. The first 
step generates K weak correlation filters. The second step is 
to make weak decisions via the weak structural correlation 
filter, s The third step is to evaluate the reliability degrees of 
weak decisions. Finally, the optimal decision is made using 
a multi-task Gaussian particle filter. 

To update the KCF-GPF for visual tracking, we adopt 
an incremental strategy in the current frame to update the 
template in Eq. 12 by 

n=p- fu + a - P ) • as) 

where learning rate parameter p is a constant value in this 
paper. 

An overview of the proposed method is summarized in 
Algorithm 1. 


Algorithm 1: KCF-GPF tracking algorithm 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


Input: Frames 

Output: Target location of each frame fj, t . 
for Time t = 1 : T do 

if t = 1 then 

Initialize the target location 0*i, tf); 

Crop interested image region; 

Initiate K weak correlation filters using Eq. 3; 

else 

Compute each correlation filter’s response 
using Eq. 4; 

Find target position predicted by each weak 
tracker using Eq. 5 ; 

Evaluate the reliability degrees APCE and 
Fmax via Eq. 6 and Eq. 8; 

if max APCE and max F macc 


n 

12 

13 

14 

15 

16 

17 

18 end 


end 


satisfy the condition then 
| Draw samples from Eq. 9; 
else 

| Draw samples from Eq. 9 where T, t = 3; 

end 

Compute the ultimate position using Eq. 15; 
Update the template via Eq. 18; 


4. Experiments 

4.1. Experimental Setups 


Here, we set d\ = 0, and | • | denotes the function for abso- Implementation Details. The conventional features used 
lution values; for KCF-GPF are composed of HOG features and gray 










Table 1. Parameters of KCF-GPF 


Part of Track 

parameters 

values 


padding 

1.2 

KCF 

Feature bandwidth cr 

0.5 


Adaptation rate 

0.045 


Sample numbers N 

200 

GPF 

Learning rate 6 (Eq. 14) 

0.65 


Learning rate p (Eq. 18) 

0.8 


normalization features. Our tracker is implemented on 
MATLAB in a PC with a 2.80 GHz CPU and runs faster 
than 21 FPS in Table 2. Our tracker requires few parameter 
settings, reported in Table 1, where ‘padding’ (referring to 
KCF [19]) means the magnification of the image region 
samples relative to the target bounding box. 

Datasets. Our method is evaluated in the OTB-2013 
dataset [43] consisting of 50 sequences. The images 
are annotated with ground truth bounding boxes and 11 
various visual attributes include scale variation, out of view, 
out-of-plane rotation, low resolution, in-plane rotation, 
illumination, motion blur, background clutter, occlusion, 
deformation, and fast motion. 

Evaluation Metrics. We compare the proposed method 
with the 16 state-of-the-art tracking methods using evalua¬ 
tion metrics and code provided by the respective benchmark 
dataset. For testing on OTB-2013, we employ the one-pass 
evaluation (OPE) and use two metrics: precision and 
success plots. The precision metric computes the rate 
of frames whose center location is within some certain 
distance from the ground truth location. The success metric 
computes the overlap ratio between the tracked and ground 
truth bounding boxes. In the legend, we report the area 
under curve (AUC) of success plot and precision score at 
a 20 pixel threshold (PS) corresponding to the one-pass 
evaluation for each tracking method. 

4.2. Comparison with State-of-the-Art 

We evaluate KCF-GPF in the OTB-2013 dataset [43] and 
compare it with 16 state-of-the-art trackers including LM- 
CF [40], CFNet [39], CFN [11], CFN [11], CNT [46], BIT 
[8], SINT [38], SCT [10], Staple [3], SiamFC [4], SRD- 
CF [13], DSST [1 ], MEEM [45], KCF [19], TLD [2: ] 
and Struck [17]. Among them, LMCF, CFN, CFN_, Sta¬ 
ple, SRDCF, KCF, DSST, CFNet and SCT are CF based 
algorithms; SiamFC, SINT, CFNet, CNT and BIT are con¬ 
volutional networks based algorithms; MEEM is developed 
based on regression and multiple trackers; TLD is based on 
an ensemble classifier; and Struck is structured SVM based 
methods. 

The characteristics and tracking results are summarized 
in Table 2. The mean FPS here is estimated on all sequences 


in the OTB-2013 and achieves 21.3 fps satisfying the re¬ 
quirement of real-time capability. LMCF achieves the sec¬ 
ond best performance in terms of the success metric and 
SINT shows the second best performance in terms of pre¬ 
cision metric. Figure 5 illustrates the precision and success 
plots of all trackers under all challenging attributes in the 
OTB-2013. KCF-GPF is also superior to other up-to-date 
trackers with precision and success evaluation metrics in the 
OTB-2013 benchmark. 

For detailed analyses, we also evaluate KCF-GPF with 
these state-of-the-art trackers on various challenging at¬ 
tributes in the OTB-2013 benchmark dataset and the results 
are shown in Figure 3 and Figure 4. The results demon¬ 
strate that KCF-GPF is ranked on top three in each attribute 
and achieves the best performances in the general success 
plots. Besides that, the proposed method outperforms other 
trackers in terms of deformation and out-of-plane rotation 
attributes. 

4.3. Qualitative Comparison 

To demonstrate the effect of the proposed KCF-GPF 
tracking algorithm, we make a qualitative comparison with 
above state-of-the-art trackers in the OTB-2013 benchmark 
dataset with 11 different attributes. As shown in Figure 6, 
all trackers perform well overall, but the existing trackers 
have the following drawbacks. The SCT does not perfor¬ 
in well under scale variations (Liquor, Woman, and Dogl). 
The CFNet cannot handle occlusion (Lemming, Skating 1, 
Subway, Singer2, Suv, Liquor, Woman and Soccer), defor- 
mation(Skatingl, Subway, Singer2, Suv and Woman) ,out- 
of-plane rotation (Lemming, Skating 1, Singer2, Liquor, 
Woman and Soccer) and background clutters (Skating 1, 
Subway, Singer2, Suv, Liquor and Soccer). The KCF drifts 
when there are illumination variations (Shaking, Lemming 
and Woman), scale variations (Shaking, Lemming, Wom¬ 
an and Dogl), out-of-plane rotations (Shaking, Lemming, 
Woman and Soccer), and fast motions (Woman and Soc¬ 
cer). The TLD and Struck methods drift when target objects 
undergo illumination changes (Shaking, Skating 1, Singer2 
and Soccer), heavy occlusion (Lemming, Subway, Singer2, 
Suv, Liquor, Woman and Soccer) and scale variations (Lem¬ 
ming and Dogl). Overall, the proposed KCF-GPF tracker 
performs the best against the existing trackers in tracking 
objects on these challenging sequences. 

5. Conclusion 

In this paper, we have proposed a novel tracker combin¬ 
ing multiple structural correlation filters with a multi-task 
gaussian particle filter, namely KCF-GPF, to construct a 
strong tracker for ensemble tracking. The proposed method 
takes multiple correlation filters as weak expert trackers, 
and exploits spatial-geometric relations between target lo¬ 
cations in consecutive frames to provide weak decisions in 











Table 2. Tracking results of all 17 evaluated trackers over all 50 sequences using OPE evaluation in the OTB-2013. The entries in red 
denote the best results and the ones in blue indicate the second best. 


Trackers 

OPE 

LMCF[40] 

CFNet[39] 

CFN[1 1] 

CFN [11] 

CNT [46] 

BIT[8] 

SINT[38] 

SCT[10] 

Staple[3] 

precision 

0.842 

0.803 

0.813 

0.784 

0.723 

0.816 

0.851 

0.836 

0.793 

success 

0.800 

0.775 

0.675 

0.630 

0.656 

0.745 

0.791 

0.730 

0.754 

Trackers 

OPE 

SiamFC[4] 

SRDCF[ ] 

DSST[E] 

MEEM[45] 

KCF[ ] 

TLD[: ] 

Struck[17] 

KCF-GPF(ours) 

mean FPS(ours) 

precision 

0.809 

0.838 

0.740 

0.840 

0.740 

0.608 

0.656 

0.857 

21.3 

success 

0.783 

0.781 

0.670 

0.706 

0.623 

0.521 

0.559 

0.805 







Success plots of OPE - occlusion (29) 



^^LMCF(CVPR2017) [0.634] 

— KCF-GPF(ours) [0.630] 

.... BIT(TIP2016) [0.628] 

SINT(CVPR2016) [0.612] 
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CFN(CVPR2017) [0.572] 

— CFNet(CVPR2017) [0.565] 

— ■ CFN-(CVPR2017) [0.537] 
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V 
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Figure 3. Success plots over all 50 sequences using OPE evaluation in the OTB-2013 dataset. The evaluated trackers are LMCF, CFNet, 
CFN, CFN , CNT, BIT, SINT, SCT and KCF-GPF. All 11 tracking challenges include scale variation, out of view, out-of-plane rotation, 
low resolution, in-plane rotation, illumination, motion blur, background clutter, occlusion, deformation, and fast motion. The numbers 
in the legend indicate the average AUC scores for success plots. Our KCF-GPF method performs favorably against the state-of-the-art 
trackers. 


a reliable search scope. The reliability degrees of weak de¬ 
cisions are introduced in experiments for the GPF to make 
a strong decision. As a result, it not only has the advantages 
of the existing correlation filter trackers, such as, computa¬ 
tional efficiency and robustness, but also can deal with scale 
variations by the sampling strategy of a GPF. The proposed 
KCF-GPF tracking algorithm outperforms 16 state-of-the- 
art methods over all 50 sequences in the OTB-2013 bench¬ 
mark in terms of qualitative and quantitative evaluations. 
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Figure 4. Success plots over all 50 sequences using OPE evaluation in the OTB-2013 dataset. The evaluated trackers are Staple, SiamFC, 
SRDCF, DSST, MEEM, KCF, TLD, Struck and KCF-GPF. All 11 tracking challenges include scale variation, out of view, out-of-plane 
rotation, low resolution, in-plane rotation, illumination, motion blur, background clutter, occlusion, deformation, and fast motion. The 
numbers in the legend indicate the average AUC scores for success plots. Our KCF-GPF method performs favorably against the state-of- 
the-art trackers. 






Figure 5. Precision and success plots over all 50 sequences using OPE evaluation in the OTB-2013 dataset. The numbers in the legend 
indicate the average precision scores for precision plots and the average AUC scores for success plots. Our KCF-GPF method performs 
favorably against the state-of-the-art trackers. 
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in our evaluation on 10 challenging sequences (from left to right and top to down are Shaking, Lemming, Skatingl, Subway, Singer2, 
Suv, Liquor, Woman, Soccer, Dogl, respectively). 
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